AMENDMENTS TO THE CLAIMS 



Claims 1-4. (Canceled) 

5. (Currently Amended) A videoconferencing system comprising: 

a conference bridge for interconnecting a plurality of remotely located 
videoconferencing stations; and 

a speaker identification subsystem to determine whether a conferee is speaking 
based, at least in part, on whether vi s ual lip movements ascertained from a video signal 
from a conference station at which the conferee is located of said conferee are 
reasonably consistent with an audio signal from a the conference station in which said 
conferee i s located , the subsystem also to determine which of the a plurality of 
conferees is speaking the loudest when multiple conferees are speaking simultaneously 
from different conference stations . 

6. (Previously Presented) The videoconference system of claim 5 wherein the 
speaker identification subsystem comprises a voice activity detector. 

7-8. (Canceled) 

9. (Previously Presented) The videoconference system of claim 6 wherein said voice 
activity detector includes image analysis and recognition software. 
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10. (Previously Presented) The videoconference system of claim 29 wherein said 
means for visually altering said image comprises means for highlighting a border around 
said image of said conferee determined to be the loudest speaker. 

11. (Currently Amended) A videoconference station comprising: 

a transmitter to transmit a combined audio video signal to a videoconference 
bridge; and 

a speaker identification subsystem located at said videoconference station to 
determine whether the conferee at said videoconference station is speaking by analyzing 
whether vi s ual lip movements of said conferee ascertained from a video signal at said 
station are substantially consistent with an audio signal at said station so as to indicate 
human speech, the subsystem also to determine which of the a plurality of conferees is 
the loudest speaker when multiple conferees are speaking simultaneously from different 
videoconference stations . 

12. (Previously Presented) The videoconference station of claim 1 1 wherein said 
speaker identification subsystem comprises a voice activity detector. 

13. (Previously Presented) The videoconference system of claim 12 wherein said 
voice activity detector includes image analysis and recognition software. 
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14. (Currently Amended) A method of displaying images of a plurality of conferees 
in a videoconference system, comprising: 

determining whether a conferee is speaking by analyzing a consistency between 
vi s ual lip movements of said conferee ascertained from a video signal from a 
conference station at which the conferee is located and an audio signal from [[a]] the 
conference station in which s aid conferee i s located such that the combination of lip 
movement and audio signal indicates human speech; 

determining which of the conferees is the loudest speaker when multiple 
conferees are speaking simultaneously from different conference stations ; and 

visually altering an image of said conferee that is displayed to other conferees 
when said conferee is determined to be the loudest speaker. 

15-28. (Canceled) 

29. (Previously Presented) The videoconferencing system of claim 5, further 
comprising: 

means for visually altering an image of said conferee displayed in other 
conference stations if said conferee is determined to be the loudest speaker of the 
plurality of conferees. 

30. (Currently Amended) A videoconferencing system comprising: 

a speaker identification subsystem located at each of a plurality of remotely 
located videoconferencing stations, the subsystem to determine whether a conferee is 
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speaking based, at least in part, on whether visual lip movements of the conferee 
ascertained from a video signal from a videoconference station at which the conferee 
is located are reasonably consistent with an audio signal from [[a]] the conference 
station in which the conferee is located , and also to determine which of the conferees is 
speaking the loudest when multiple conferees are speaking simultaneously; and 

a conference bridge coupled to each of the videoconferencing stations to visually 
alter an image of a conferee who is speaking before combining the image with images 
received from each of the other videoconferencing stations and transmitting a combined 
image back to each of the videoconferencing stations. 



31-34. (Cancelled) 
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